scientific collaboration
Towards an Introspective Dynamic Model of Globally Distributed Computing Infrastructures
Kilic, Ozgur O., Park, David K., Ren, Yihui, Korchuganova, Tatiana, Vatsavai, Sairam Sri, Boudreau, Joseph, Chowdhury, Tasnuva, Feng, Shengyu, Khan, Raees, Kim, Jaehyung, Klasky, Scott, Maeno, Tadashi, Nilsson, Paul, Outschoorn, Verena Ingrid Martinez, Podhorszki, Norbert, Suter, Frédéric, Yang, Wei, Yang, Yiming, Yoo, Shinjae, Klimentov, Alexei, Hoisie, Adolfy
Large-scale scientific collaborations like ATLAS, Belle II, CMS, DUNE, and others involve hundreds of research institutes and thousands of researchers spread across the globe. These experiments generate petabytes of data, with volumes soon expected to reach exabytes. Consequently, there is a growing need for computation, including structured data processing from raw data to consumer-ready derived data, extensive Monte Carlo simulation campaigns, and a wide range of end-user analysis. To manage these computational and storage demands, centralized workflow and data management systems are implemented. However, decisions regarding data placement and payload allocation are often made disjointly and via heuristic means. A significant obstacle in adopting more effective heuristic or AI-driven solutions is the absence of a quick and reliable introspective dynamic model to evaluate and refine alternative approaches. In this study, we aim to develop such an interactive system using real-world data. By examining job execution records from the PanDA workflow management system, we have pinpointed key performance indicators such as queuing time, error rate, and the extent of remote data access. The dataset includes five months of activity. Additionally, we are creating a generative AI model to simulate time series of payloads, which incorporate visible features like category, event count, and submitting group, as well as hidden features like the total computational load-derived from existing PanDA records and computing site capabilities. These hidden features, which are not visible to job allocators, whether heuristic or AI-driven, influence factors such as queuing times and data movement.
- North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
- North America > United States > Tennessee > Anderson County > Oak Ridge (0.04)
- North America > United States > California > San Mateo County > Menlo Park (0.04)
Proximity Matters: Analyzing the Role of Geographical Proximity in Shaping AI Research Collaborations
Toobaee, Mohammadmahdi, Schiffauerova, Andrea, Ebadi, Ashkan
The role of geographical proximity in facilitating inter-regional or inter-organizational collaborations has been studied thoroughly in recent years. However, the effect of geographical proximity on forming scientific collaborations at the individual level still needs to be addressed. Using publication data in the field of artificial intelligence from 2001 to 2019, in this work, the effect of geographical proximity on the likelihood of forming future scientific collaborations among researchers is studied. In addition, the interaction between geographical and network proximities is examined to see whether network proximity can substitute geographical proximity in encouraging long-distance scientific collaborations. Employing conventional and machine learning techniques, our results suggest that geographical distance impedes scientific collaboration at the individual level despite the tremendous improvements in transportation and communication technologies during recent decades. Moreover, our findings show that the effect of network proximity on the likelihood of scientific collaboration increases with geographical distance, implying that network proximity can act as a substitute for geographical proximity.
- Europe (0.28)
- North America > Canada > Ontario > Toronto (0.14)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- (6 more...)
- Government > Regional Government (0.68)
- Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
Convolutional Neural Networks for signal detection in real LIGO data
Zelenka, Ondřej, Brügmann, Bernd, Ohme, Frank
Searching the data of gravitational-wave detectors for signals from compact binary mergers is a computationally demanding task. Recently, machine learning algorithms have been proposed to address current and future challenges. However, the results of these publications often differ greatly due to differing choices in the evaluation procedure. The Machine Learning Gravitational-Wave Search Challenge was organized to resolve these issues and produce a unified framework for machine-learning search evaluation. Six teams submitted contributions, four of which are based on machine learning methods and two are state-of-the-art production analyses. This paper describes the submission from the team TPI FSU Jena and its updated variant. We also apply our algorithm to real O3b data and recover the relevant events of the GWTC-3 catalog.
- North America > United States (0.14)
- Asia > Japan (0.04)
- Europe > Germany > Lower Saxony > Hanover (0.04)
- (11 more...)
Internationalizing AI: Evolution and Impact of Distance Factors
Tang, Xuli, Li, Xin, Ma, Feicheng
International collaboration has become imperative in the field of AI. However, few studies exist concerning how distance factors have affected the international collaboration in AI research. In this study, we investigate this problem by using 1,294,644 AI related collaborative papers harvested from the Microsoft Academic Graph (MAG) dataset. A framework including 13 indicators to quantify the distance factors between countries from 5 perspectives (i.e., geographic distance, economic distance, cultural distance, academic distance, and industrial distance) is proposed. The relationships were conducted by the methods of descriptive analysis and regression analysis. The results show that international collaboration in the field of AI today is not prevalent (only 15.7%). All the separations in international collaborations have increased over years, except for the cultural distance in masculinity/felinity dimension and the industrial distance. The geographic distance, economic distance and academic distances have shown significantly negative relationships with the degree of international collaborations in the field of AI. The industrial distance has a significant positive relationship with the degree of international collaboration in the field of AI. Also, the results demonstrate that the participation of the United States and China have promoted the international collaboration in the field of AI. This study provides a comprehensive understanding of internationalizing AI research in geographic, economic, cultural, academic, and industrial aspects.
- Asia > Japan (0.05)
- Asia > China > Hubei Province > Wuhan (0.05)
- Europe > United Kingdom (0.04)
- (13 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Banking & Finance (1.00)
- Education (0.67)
- Health & Medicine > Epidemiology (0.46)
Pull US AI Research Out of China
This piece was updated Aug. 11 to add information from Google. Recently, the Biden administration and a host of allies called out China for its massive Microsoft Exchange hack (among others), and threatened strengthened cyber defense measures and continued exposure of the PRC's malicious cyber activity. But despite a lot of hard talk about securing America's cyber defenses, action is wanting, and the government has failed to address a glaring boon to the PRC's cyber capabilities: our own companies' AI research centers in China. Housing the AI research labs of America's cutting-edge tech companies in authoritarian China was never a good idea. But given that the Chinese government uses foreign tech companies to help find and exploit security vulnerabilities, and that it is claiming ever more control over tech companies' operations and data, it looks more objectionable than ever. AI is an increasingly crucial element of cyber security and hacking, and Xi Jinping's China has demonstrated time and time again that China's high-tech sector serves the CCP, which sees AI technology in particular as a core tool of its future autocratic rule.
- North America > United States (1.00)
- Asia > China > Beijing > Beijing (0.05)
Disentangling homophily, community structure and triadic closure in networks
Network homophily, the tendency of similar nodes to be connected, and transitivity, the tendency of two nodes being connected if they share a common neighbor, are conflated properties in network analysis, since one mechanism can drive the other. Here we present a generative model and corresponding inference procedure that is capable of distinguishing between both mechanisms. Our approach is based on a variation of the stochastic block model (SBM) with the addition of triadic closure edges, and its inference can identify the most plausible mechanism responsible for the existence of every edge in the network, in addition to the underlying community structure itself. We show how the method can evade the detection of spurious communities caused solely by the formation of triangles in the network, and how it can improve the performance of link prediction when compared to the pure version of the SBM without triadic closure.
- Europe > United Kingdom (0.14)
- Europe > Austria > Vienna (0.14)
- Europe > Spain > Galicia > Madrid (0.04)
- (7 more...)
- Health & Medicine (1.00)
- Information Technology (0.69)
- Leisure & Entertainment > Sports (0.68)
- Education > Educational Setting > K-12 Education (0.47)
- Information Technology > Data Science (1.00)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Communications > Networks (1.00)
- (2 more...)